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HOMOLOGOUS RECOMBINATION IN MISMATCH REPAIR INACTIVATED EUKARYOTIC CELLS 



BACKGROUND OF THE INVENTION . 
Introduction 

The introduction of specific modifications in the prokaryotic and eukaryotic genome is a 
powerful tool in studying gene function both at the level of individual cells and in the context 
of a complete organism. In addition, modification of specific genes may result in the 
generation of industrially and medically important organisms, whereas correction of defective 
alleles in eukaryotic cells may provide a substantial step forward in the development of 
somatic gene-therapy protocols. 

The method of genetic modification relies on the ability of virtually every cell type to 
exchange DNA sequences with a high degree of nucleotide sequence similarity by a 
process which is called homologous recombination (1). Briefly, the method of genetic 
modification involves the generation of a so-called targeting construct, a DNA sequence that 
is largely identical to the specific chromosomal locus to be modified, but differing from this 
locus by specific modifications. These modifications can be as small as the deletion, 
insertion or substitution of a single base-pair, or be as large as the deletion or insertion of 
ten's of kilobasepairs. On entry of the targeting construct into the cell, exchange of 
sequences flanking the modification with their chromosomal counterparts, will result in the 
introduction of the modification into the recipient chromosome. 

The efficiency of homologous recombination in both prokaryotes and eukaryotes strongly 
depends on complete sequence identity of exchanging DNA strands (2-5). Thus, sequence 
dissimilarities as small as 0.5% can already strongly impede homologous recombination. In 
several bacterial species, Escherichia coll Salmonella typhimurium and Streptococcus 
pneumoniae as well as in the yeast Saccharomyces cerevisiae it has been unequivocally 
shown that the DNA mismatch repair system is responsible for suppressing recombination 
between homologous but nonidentical DNA sequences (6-8). 

We have now demonstrated that in mammalian cells deficient for the DNA mismatch repair 
gene M$h2, homologous recombination has lost the requirement for complete sequence 
identity between exchanging DNA sequences. This finding provides a new method for 
modifying the eukaryotic genome using DNA targeting constructs which substantially differ 
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from the target locus in the region where recombination takes place by genetically and/or 
functionally inactivating the cell's mismatch correction system. 

The method obviates the requirement for DNA targeting constructs that are largely identical 
to the target locus, thus allowing the efficient genetic modification of both somatic cells and 
the germ line of outbred organisms. This may provide a route to somatic gene therapy and 
the modification of eukaryotic species of which inbred strains are not easily available. Also, 
genomic sequences can be replaced by small oligonucleotides carrying one or more 
basepair alterations or by large chromosomal sequences derived from other species. The 
method can also be used to generate large deletions by intra- or extrachromosomal 
homologous recombination between repeated but diverged sequences. 

Genetic modification of mice 

The introduction of specific genetic alterations into the germ line of mice was made possible 
by a combination of new techniques in molecular biology and embryology that have 
become available during the last ten years (9-11). The method entails the introduction of a 
planned genetic modification in embryonic stem (ES) cells by homologous recombination. 
ES cells are originally derived from the inner cell mass of 3.5 day old pre-implantation 
embryos and can be maintained by in vitro culture as immortalized cell lines, retaining the 
undifferentiated state under appropriate culture conditions. At present, a number of ES cell 
lines are available. ES cells injected into the blastocoel of 3.5 day blastocyst-stage 
embryos can efficiently compete with the inner-cell-mass cells of the recipient blastocyst in 
embryonic development thus generating a chimeric mouse consisting of cells derived from 
the recipient blastocyst and the injected ES cells. Should the in vitro modified ES cells 
contribute to the germ line of chimeric animals, the ES cell genome can be transmitted to the 
next generation giving animals carrying the introduced modification in all their cells. 
Intercrossing of such animals reveals the phenotypic consequences of homozygosity of the 
modified gene. 

The most commonly introduced genetic modification in ES cells is gene disruption leading to 
gene inactivation. This allows to study gene function by analyzing the consequences of the 
absence of a particular gene in the context of a complete organism. Also, many hereditary 
human diseases result from hemizygosity of specific genes [e.g. cancer predisposition 
syndromes like hereditary nonpoiyposis colorectal cancer (12), Li Fraumeni syndrome (13), 
retinoblastoma (14)]. The generation of mouse models for such diseases by disruption of 
the mouse homologs of the genes involved provides an invaluable tool for studying 
hereditary diseases in an experimental setting. 
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Gene disruption in ES cells 

Thelnactivation of a specific gene in ES cells via homologous recombination starts with the 
preparation of a DNA targeting construct (15). Two types of targeting constructs can be 
used. In a replacement-type vector, a drug-resistance marker gene (conferring to the cell 
resistance to drugs like G418, Hygromycin B, Puromycin, Histidinol) disrupts a sequence 
which is homologous to a sequence in or around the target gene in the recipient genome; in 
an insertion-type vector the marker gene flanks the homologous sequence. The targeting 
construct is introduced into ES cells by electroporation (or alternatively by Ca-Phosphate 
precipitation, lyposome-mediated DNA transfer or micro-injection) and cells are selected for 
stable integration of the targeting construct into the recipient genome by growth in medium 
containing the appropiate drug. Drug-resistant colonies are the result of either one of two 
events: integration of the targeting construct at a random site in the genome or integration of 
the marker gene in the target locus via homologous recombination between the flanking 
sequences and their chromosomal counterparts. The replacement-type vector integrates by 
homologous recombination on both sides of the marker gene; the insertion-type vector 
integrates by a single homologous recombination event leading to duplication of the region 
of homology. Thus, the marker gene serves two purposes: it allows selection of cells that 
have taken up the targeting construct and, on integration via homologous recombination, it 
will disrupt the target gene thereby modifying its function. 

Distinguishing ES cell clones resulting from homologous recombination from those resulting 
from random integration, requires the DNA of individual clones to be analysed by Southern 
hybridisation or the polymerase chain reaction. 

Unfortunately, in many targeting experiments, random integration was often found to be far 
more efficient than homologous recombination and also large variations in targeting efficiency 
were observed for different genes (16). In this respect, mammalian cells differ from bacteria 
and lower eukaryotes like yeast (17), Leishmania major (18) or Trypanosoma b/i/ce/(19), 
where integration of exogenous DNA into the recipient genome exclusively or predominantly 
occurs via homologous recombination. To date, three factors clearly affecting the recovery of 
homologous recombinants in mammalian cells have been identified. First, the frequency of 
homologous recombination increases substantially with the total length of the homologous 
sequences up to 14 kilobase-pairs (20). Second, the expression level of the marker gene at 
the target locus affects the frequency of recovery of homologous recombinants: low 
expression may lead to loss of homologous recombinants, whereas high expression may 
allow selection of homologous recombinants at an elevated drug concentration (21). Third, 
sequence dissimilarities between the targeting construct and the chromosomal target locus 
strongly suppress the efficiency of homologous recombination (5). 
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High efficiency targeting with isogenic DNA constructs 

The suppression of homologous recombination in ES cells by small sequence dissimilarities 
became clear by a gene targeting experiment aimed at disrupting the Retinoblastoma gene 
with a neomycin resistance marker gene. Two targeting constructs were prepared carrying 
the neomycin resistance marker embedded in 10.5 kb of Resequence. In one construct the 
Rb sequence was derived from mouse strain 129, and was therefore identical to the 
corresponding chromosomal locus in the ES cells, which were also derived from mouse 
strain 129. In the other construct, the Rb sequence was derived from mouse strain BALB/c. 
The two constructs, designated 129Rb-neo and B/cRb-neo are described in the 
accompanying publication Proc. Natl. Acad. ScL USA, Vol. 89, pp. 5128-5132 on pages 
5128 and 5129, Fig. 1 .The two constructs contained corresponding Rb sequences and 
were therefore largely similar. However, they differed approximately 0.6% at the nucleotide 
level (which corresponds to the level of sequence polymorphism found in the human 
population). Thus, in a stretch of 1687 basepairs that was sequenced, the BALB/c 
sequence differed from the 129 sequence by 9 basepair substitutions, three small deletions 
(of 1,4 and 6 nucleotides) and two polymorphic CA-repeats (see Proc. Natl. Acad. Sci. 
USA, Vol. 89, pp. 5128-5132, page 5130, Fig. 4). On introduction of these constructs in 
129-derived ES cells, homologous recombination at Rb with the 129-derived construct was 
50-fold more efficient than with the nonisogenic BALB/c-derived construct. To provide 
additional evidence that the suppression of recombination was solely dependent on the 
polymorphisms between the endogenous locus and the targeting DNA, the inverse 
experiment was performed, i.e. targeting of a BALB/c-derived ES cell line with the 129- and 
BALB/c-derived constructs. This experiment yielded the inverse result, i.e. a higher 
targeting efficiency with the BALB/c-derived construct than with the nonisogenic 129- 
derived construct. 

With a somewhat different targeting construct, consisting of a hygromycin resistance gene 
embedded in 17 kb of isogenic Rb DNA, we observed that 80% of all Hygromycin B- 
resistant colonies resulted from homologous recombination (5). This demonstrates that, in 
the presence of perfect homology, also in mammalian cells homologous recombination rather 
than random integration can be the predominant event. 

Although clearly not all problems of gene targeting have been solved, many genes have 
now been successfully targeted by the use of isogenic DNA targeting constructs. However, 
genetic modification of cells derved from an outbred organism can become a difficult 
endeavour as isogenic targeting constructs are not easily available. In this case, efficient 
gene targeting would require the targeting construct to be prepared from DNA derived from 
the target cell. Especially in the context of gene therapy, this would raise a tremendous 
practical obstacle to correction of a defective gene. Also, base sequence divergence 
imposes a major barrier to exchanging a large chromosomal region of one species by the 
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syntenic region of another species. The present invention provides a way to overcome 
these problems. 

The introduction of subtle mutations 

Although protocols for disruption of genes in inbred ES cell lines (and in somatic cell lines of which 
isogenic DNA targeting constructs can be prepared) are rather well developed, the introduction of 
more subtle mutations is not straightforward. Current protocols are variations on a two-step 
procedure in which first a marker gene is introduced into the target gene followed by replacement of 
the marker gene by the desired subtle mutation (5). This procedure requires the marker gene to be 
selectable both for its presence (first step) and its absence (replacement by the subtle mutation). 

Useful marker genes are the Hprt minigene to be used in Hp/T-deficient ES cells (positive selection 
in HAT medium; negative selection by 6-thioguanine) and a combination of the neomycin resistance 
gene (positive selection by G418) and the Herpes Simplex Virus thymidine kinase gene (negative 
selection by Gancyclovir). In an alternative procedure, the subtle mutation and the marker gene are 
present on the same targeting construct and concomitantly introduced into the genome by 
homologous recombination. If an insertion-type vector was used, the marker gene can be removed 
by intrachromosomal recombination between the duplicated sequences that were generated during 
the first integration event (23). In case of a replacement-type vector, the marker gene can be 
removed if it was flanked by two site-specific-recombination sites (e.g. toxP sites). Recombination 
between these sites on introduction into the cell of the foxP-specific recombinase Cre will remove 
the marker gene from the genome (24). 

Although either of the above mentioned procedures has allowed the subtle modification of a number 
of genes in ES cells, they are highly demanding as to the generation of appropiate DNA targeting 
constructs and the culturing of ES cells under various selective conditions. 
Therefore, an attractive alternative to these procedures might be the use of small single- or double- 
stranded oligonucleotides (up to 100 bases or basepairs), which are identical to the target locus 
except for one or several basepair alterations. However, our finding that base sequence 
dissimilarities as small as 0.6% strongly suppress homologous recombination, may impose a major 
impediment to using such oligonucleotides for the introduction of subtle genetic modifications. The 
present invention provides a way to overcome this problem and may allow the subtle modification 
of cell lines and cells derived from living organisms and temporarily cultured in vitro. 
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GENETIC MODIFICATION OF EUKARYOTIC CELLS USING DNA CONSTRUCT* 
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DESCRIPTION OF THE INVENTION 

The manuscript included as annex in this application provides a detailed description of 
the methodology used in this application and will be published in the journal Cell on July 
28, 1995. 

As described above, base sequence dissimilarities as modest as 0.6% impose a strong barrier to 
efficient homologous recombination in mouse embiyonic stem cells (5). Suppression of homologous 
recombination by small base sequence divergencies was earlier observed in bacteria (E. coli, S. 
typhimuhum and S. pneumoniae), yeast and mouse fibroblasts (2-4). The role of the DNA mismatch 
correction system in suppressing recombination between homologous but diverged sequences, 
was most dramatically demonstrated by Radman and coworkers in studying bacterial conjugatioh 
between the related but diverged species Escherichia coli and Salmonella typhimurium (6). This 
process relies on entry of chromosomal fragments of one species into the other and recombination of 
these fragments with the chomosome of the recipient bacterium. The sequence divergence between 
the two species is estimated to be 20-30% and therefore the recovery of exconjugants from an 
interspecies cross is about 2x105 fold lower than from an intraspecies cross. However, the 
recovery of exconjugants from the interspecies cross increased 3x103 fold jf the recipient bacteria 
carried an inactivating mutation in either the mutS or mutL gene, both of which being essential for 
DNA mismatch correction. 

The central protein in DNA mismatch correction in Ecoli is encoded by the mutS gene (25). It 
recognizes and binds to base mispairs and small loops of up to four unpaired nucleotides After 
binding to heteroduplex DNA, the MutS-DNA complex is bound by the mutL gene product which 
leads to excision of a tract of single-stranded DNA of up to several kilobases that contains the 
mispaired nucieotide(s). In this process also the mutUgene product plays a role, whereas the mutH 
gene product ensures removal of newly synthesized strands rather than parental strands. The 
repair process is completed by resynthesis of the excised strand and ligation of the remaining nick. 
Mismatch repair in E. coli is responsible for maintaining genome stability in at least two ways: i) by 
recognizing and repairing mis- and unpaired nucleotides that occur, respectively, by 
misincorporatjon and slippage during DNA replication; ii) by recognizing mismatches occurring in 
het roduplexes formed at initial stages of recombination between homologous but not-identical 
s quences. This may either lead to blocking elongation of heteroduplex formation or dissociation of 
the heteroduplex thus aborting the recombination reaction. Consequently, E co//strains defective 
for either mutS or mutL have a pleiotropic phenotype: an increased mutation rate, including 
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destabiiization of simple-sequence repeats and an increased rate of recombination between 
homologous but diverged DNA sequences. The latter phenotype is clearly manifested by the 
efficient recovery of recombinant bacteria resulting from conjugational crosses between the related 
but diverged species Escherichia coli and Salmonella typhimurium wherein the recipient bacterium 
was deficient for mutS or mutL (6). Also, the frequency of chromosomal rearrangements by ectopic 
recombination between diverged sequences is substantially elevated in mismatch repair deficient 
bacteria (26). 

In many respects, the biochemistry of mismatch repair systems in eukaryotes resembles 
that of the £ coli mutS f L system. Homologs of both genes have been identified in yeast 
and mammalian cells. Based on mismatch binding in vitro, and on the mutator and 
recombinator phenotypes of Saccharomyces cerevisiae mutants, the protein encoded by 
the yeast MSH2 gene seems to be the functional homolog of MutS (27-29). A homolog of 
the yeast MSH2 gene was identified in mammalian cells by analysis of a GT-mismatch- 
binding activity, positional cloning and PCR amplification of mouse DNA using degenerate 
primers (30). Similarly, homologs of the £ coli mutL gene were identified in yeast and 
mammalian cells. 

Interestingly, inherited mutations in human mutS and mutL homologs were recently found to 
be related to the cancer predisposition syndrome HNPCC (hereditary nonpolyposis 
colorectal cancer), which is characterized by development of tumors of the proximal colon at 
early age. In these tumors, mismatch repair is lost, as manifested by destabiiization of 
simple sequence repeats, the replication error-positive (RER+) phenotype (12). 

In order to investigate the role of the mammalian Msh2qene in DNA mismatch repair and to 
address the role of mismatch repair in maintaining genome stability, we generated an ES cell 
line carrying a disruption in both copies of the mouse MSH2 gene. This line is disignated 
dMsh2-9. Its construction is described in the accompanying manuscript: 
"Inactivation of the mouse Msh2 gene results in mismatch repair deficiency, 
methylation tolerance, hyperrecombination, and predisposition to cancer" , by 
Niels de Wind, Marleen Dekker, Anton Berns, Miroslav Radman, and Hein te Riele, 
to be published in Cell, the 28th July, 1995 on pages 6, 14, 27 (legend to Fig.1) and 30 
(Fig. 1). 

As demonstrated in the manuscript, the phenotypic consequences of Msh2 deficiency in 
mouse embryonic stem cells provide clear evidence for an essential role of MSH2 in 
mammalian DNA mismatch repair. First, Ateh2-deficient ES cells lack binding activity to a 
double stranded 38-mer oligonucleotide carrying a GT mismatch or an unpaired TG 
dinucleotide (pages 6, 15, 27 (legend to Fig.2) and 30 (Fig.2)]. Second, /Wsft2-deficient ES 
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ceHs have a mutator phenotype as evidenced by an at least 150-fold increase in the 
number of cells resistant to 6-thioguanine, indicating mutational inactivation of the X-linked 
Hprt gene. Moreover, microsatellite length instability was observed in subclones derived 
from the Ms/?2-deficient ES cell line, but not in subclones derived from wild-type ES cells 
[pages 7, 15, 27 (legend to Fig.3) and 30 (Fig.3)]. Third, M>/)2-deficient ES cells resisted a 
20-fold higher concentration of the methylating agent N-Methyl-N'-Nitro-N-Nitrosoguanidine 
than wild-type ES cells [pages 7/8, 16, 27 (legend to Fig.4) and 31 (Fig. 4)]. Fourth, mice 
bred to homozygosity for the disrupted Msh2 allele, originally generated in ES cells, were 
highly predisposed to tumorigenesis [pages 9, 14/15, and 29 (Table 2)]. 
Thus, disruption of Msh2 leads to inactivation of mammalian DNA mismatch repair. 

By using a gene targeting assay, we demonstrated that it is the mammalian DNA mismatch 
repair system which is responsible for suppressing homologous recombination between 
sequences differing as little as 0.6% at the nucleotide level. As described above, 
homologous recombination at the Rb locus with an isogenic DNA targeting construct was 
50-fold more efficient than with a similar, but nonisogenic construct containing on the 
average 0.6% sequence divergence with respect to the target locus (5). However, in 
Ms/72-deficient ES cells, homologous recombination at Rb with the nonisogenic targeting 
construct was as efficient as with the isogenic construct [see the accompanying manuscript: 
•Inactivation of the mouse Msh2 gene results in mismatch repair deficiency, 
methylation tolerance, hyperrecombination, and predisposition to cancer" , by 
Niels de Wind, Marleen Dekker, Anton Bems. Miroslav Radman, and Hein te Riele [pages 
8/9, 16, 28 (Table 1)]. 

This experiment demonstrates that it is the mammalian DNA mismatch repair system which 
is involved in preventing recombination between homologous but diverged DNA 
sequences. 



This finding provides a method for modifying the mammalian genome but also the genome of 
any other eukaryotic organism via homologous recombination using DNA targeting 
sequences that substantially differ from the target locus with respect to the nucleotide 
sequence, by genetic or functional inactivation of the cell's mismatch repair system. 
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GENETIC MODIFICATION OF EUKARYOTIC CELLS USING DNA CONSTRUCTS 
WITH SUBSTANTIAL BASE SEQUENCE DIFFERENCES WITH RESPECT TO THE 
TARGET LOCUS. 
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This manuscript is part of the application and contains a detailed description 
of the methodology as described in the application. This manuscript will be 
published in the journal Cell on July 28, 1995. 
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Summary 

To investigate the role of the presumed DNA mismatch repair (MMR) gene 
Msh2 in genome stability and tumorigenesis, we have generated cells and 
mice that are deficient for the gene. MsA 2 -deficient cells have lost mismatch 
binding, and have acquired microsatellite instability, a mutator phenotype 
and tolerance to mediating agents. Moreover, in these cells, homologous 
recombination has lost dependence on complete identity between interacting 
DNA sequences, suggesting that Msh2 is involved in safeguarding the 
genome from promiscuous recombination. Ms ^-deficient mice display no 
major abnormalities, but a significant fraction develops lymphomas at early 
age. Thus, Msh2 is involved in MMR, controlling several aspects of genome 
stability; loss of MMR-controIled genome stability predisposes to cancer 
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Introduction 

Cancer is the endpoint of an evolutionary process in which normal cells acquire full 
malignancy by cumulative genetic or epi-genetic alterations, each conferring proliferative, 
invasive or metastatic potential to the cell. In colorectal cancer, at least six independent 
transforming mutations in oncogenes or tumor suppressor genes are required for a cell to 
develop into a fully malignant tumor (reviewed by Vogelstein and Kinzler, 1993). This, 
evolutionary process might be dramatically accelerated by the early loss of systems that 
safeguard genome stability, resulting in an increased rate of mutagenesis and chromosomal 
rearrangements (Hartwell, 1992 and Loeb, 1994). In support of this hypothesis, a number of 
hereditary cancer predisposition syndromes are attributed to defects in DNA repair. Two of 
these are Hereditary Nonpolyposis Colorectal Cancer (HNPCC) and Muir-Torre syndrome, 
which are characterized by the development of tumors of the proximal colon at early age. 
Also cancers of other parts of the gastrointestinal tract and of the genitourinary tract are often 
found (Lynch et al., 1993). Muir-Torre patients are, in addition, prone to cancers of the 
sebaceous glands and the skin (Cohen et al., 1991). Individuals affected by one of these' 
syndromes have most likely inherited a mutant allele of either one of four mismatch-repair 
(MMR)-related genes called MSH2 , MLH1, PMS1 or PMS2 (Leach et al., 1993; Bronner 
et al., 1994; Kolodner et al., 1994; Liu et al., 1994, Nicolaides et al., 1994; Papadopouios et 
al., 1994; Kolodner et al., 1995; reviewed by Jiricny, 1994 and by Modrich, 1994). In 
tumors of these patients, MMR is lost . presumably through inactivation or loss of the wild- 
type allele (Leach et al., 1993; Nicolaides et al., 1994; Papadopouios et al., 1994; Hernrninki 
et al. 1994). The mechanisms of loss of MMR, the cause of the apparent tissue specificity of 
tumorigenesis and the role of loss of MMR in the evolution of a normal cell to fully malignant 
tumors are still obscure. Moreover, recent data suggest that MMR is frequently lost in many 
types of sporadic tumors (Han et al., 1993; Parsons et al., 1993; Risinger et al., 1994; 
Chong et al., 1994; Mironov et al., 1994; Orth et al., 1994; Shibata et al., 1994; Suzuki et 
al., 1994; Umar et al., 1994b; Wada et al., 1994; Wooster et al., 1994; Liu et al., 1995). 
This supports the notion that loss of MMR may play a general role in tumor evolution. 
The best studied MMR system with respect to both genetics and biochemistry is the mutSL 
pathway of Escherichia coli (reviewed by Modrich, 1991). Central in this pathway is the 
protein encoded by the mutS gene, which binds to base mispairs and to loops of up to four 
unpaired nucleotides. After binding to the heteroduplex, the MutS-DNA complex is bound by 
the mutL gene product, triggering excision of a tract of single-stranded DNA of up to several 
kilobases that contains the mismatched nucleotide(s). The repair process is completed by 
resynthesis of the excised DNA strand and ligation of the remaining nick. MMR in E. coli is 
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responsible for maintaining genome integrity in at least two ways: (i) by recognizing and re- 
pairing mispaired and extrahelical nucleotides that occur by, respectively, misincorporation 
and slippage during replication and (ii) by recognizing mismatches occurring in 
recombination intermediates between homologous but diverged sequences, leading to 
abortion of recombination. Consequently, E. coli strains deficient for either mutS or mutL 
have a pleiotropic phenotype: an increased mutation rate, including destabilization of simple- 
sequence repeats (also called microsatellites; the 'Replication Error 1 ", or RER+ phenotype) ; 
and promiscuity in inter- and intragenomic recombination (the recombinator phenotype, 
Rayssiguier et al., 1989; Petit et al., 1991). 

In many respects the biochemistry of MMR systems in eukaryotes resembles that of the 
mutSL system (Holmes et al., 1990; Varlet et al., 1990; Thomas et al., 1991; Fang and 
Modrich, 1993). Homologs of both mutS and mutL were found in eukaryotes. Based on 
mismatch binding in vitro and on the mutator and recombinator phenotypes of 
Saccharomyces cerevisiae mutants, the protein encoded by the yeast MSH2 gene (Reenan 
and Kolodner, 1992a) seems to be the functional homolog of MutS (Reenan and Kolodner, 
1992b; Miret et al, 1993; Alani et al, 1995). In higher eukaryotes (Xenopus laevis, mouse' 
and human) homologs of the yeast MSH2 gene have recently been identified, based on either 
sequence homology, direct sequencing of the major mismatch-binding protein or by 
positional cloning (Fishel et al., 1993; Leach et al., 1993; Palombo et al., 1994; Varlet et al., 
1994). An apparent difference between MutS and MSH2, is the binding of the eukaryotic 
protein, when overproduced and purified from E. coli or yeast, to stretches of up to 14 
extrahelical nucleotides (so called insertion-deletion-type loops, or IDLs) in vitro (Fishel et 
al., 1994b; Alani et al., 1995), suggestive of a role for Msh2 in the repair of these loops 
(Umar et al, 1994). Eukaryotic homologs of the mutL gene, called MLH1, PMS1 and PMS2 
have been analyzed in yeast (Kramer et al., 1989a; Kramer et al., 1989b; Strand et al., 1993; 
Prolla et al., 1994a; Prolla et al., 1994b) and human cells (Li and Modrich, 1995). 
The precise role of these mutS and mutL homologs in MMR and the role of MMR in the 
maintenance of genome stability in mammalian cells remain to be established. In addition, the 
causative role of MMR in cellular sensitivity to simple methylating agents, as was suggested 
by the loss of MMR after selection of cells for tolerance to these agents (Branch et al., 1993; 
Kat et al., 1993) and by. the methylation tolerance of a MMR-deficient tumor cell line (Koi et 
al., 1994) remains to be explored. 

To investigate the involvement of Msh2 in mammalian MMR and the roles of MMR in 
maintaining genome stability we have generated mouse Embryonic Stem (ES) cell lines 
which lack both copies of the Msh2 gene. We show that Mr/i2-deficient cells lack binding to 
some specific mismatches, have acquired a mutator phenotype and are tolerant to simple 
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methylating agents. Moreover, these cells have lost the barrier to recombination between 
homologous but diverged sequences. Taken together, these results suggest that MMR is 
essential for the preservation of multiple aspects of genome stability in mammalian cells. 
To study the effect of MMR deficiency on the etiology and evolution of cancer we generated 
mice lacking one or both copies of the Msh2 gene in part or all of their cells. Although 
heterozygous mice, a model for HNPCC, remain healthy to at least 8 months of age, a 
significant fraction of both chimeric and fully homozygous Msh2 knock-out mice succumbs 
to lymphomas. These results provide evidence that loss of MMR-controlled genome stability 
can accelerate the development of cancer. 
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Results 

Targeted disruption of the Msh2 gene in mouse ES cells 

Heterozygous ES cell lines sMsh2-42 and sMsh2-55, carrying a hygromycin-resistance gene 
inserted between codons 588 and 589 of one allele of the Msh2 gene, were generated by gene 
targeting using the construct shown in Fig. I A. Cell line dMsh2-9, carrying the hyg marker 
in both alleles of the Msb.2 gene, was obtained by selecting cell line sM<r/z2-55 for duplication 
of the chromosome carrying the targeted Msh2 allele at an elevated hygromycin B 
concentration, a method that was previously shown to be effective using the neomycin- 
resistance marker (Mortensen et al, 1992). Targeting events were detected by Southern blot 
analysis, using probes flanking both sides of the targeting construct (Fig. 1A). An example 
of such a Southern blot, demonstrating the status of the Msh2 alleles in cell lines wt-2 (a 
control cell line with the targeting construct integrated by non-homologous recombination), 
sMsh2-55 and AMsh2-9 is shown in Fig. IB. Diploidy of cell lines sMsh2-55 and AMsh2-9 
was verified by karyotyping (not shown). 

The hyg marker was inserted, in the antisense orientation, N-terminally of the putative ATP 
binding site (Fig. 1A) in a region where (in the human gene) truncations were found in many 
HNPCC tumors (Liu et ai., 1994). Moreover, no Msh2 transcript could be detected in ES 
cell line dM^2-9 by Northern blotting (not shown), indicating that the hyg marker strongly 
suppressed Msh2 expression. The genotype of ES cell line 6Msh2-9 will therefore be 
indicated by Msh2-I-. 

Mismatch binding in Msh2-/. cells 

Two different mismatch-binding activities have been described in mammalian cell extracts 
using gel-retardation assays: (i) An activity that recognises G.T mismatches (Jiricny et al., 
1988; Stephenson and Karran, 1989; Griffin and Karran, 1993), an extrahelical TG 
dinucleotide (Aquilina et al; 1 994) representing an intermediate in replicational slippage of a 
microsatellite and, weakly, G.A, G.G, A.C and G.U mismatches (Stephenson and Karran, 
1989; Hughes and Jiricny, 1992). (ii) In one cell line, an independent activity was detected 
that recognizes A.C mismatches and also pyrimidine pyrimidine mismatches (Stephenson 
and Karran, 1989). Based on protein sequencing, the purified G.T-binding activity was 
shown to contain the MSH2 protein (Palombo et al., 1994). To characterize the mismatch- 
binding properties of the Msfi2-I- ES cell line, we have performed gel-retardation assays 
using cell extracts of wild-type ES cell line wt-2 and M^-deficient line dMsh2-9. Extracts 
were incubated with radiolabeled 38-mer double-stranded oligonucleotides containing either 
a mismatch, an extrahelical dinucleotide, or a 14-nucleotide insertion-deletion-type loop 
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(IDL), followed by electrophoresis on a nondenaturing poly aery lamide gel. Results of these 
experiments as shown in Fig. 2 clearly demonstrate binding activity in wild-type cell extracts 
to a G.T mismatch and to an extrahelical TG dinucleotide. Binding to both oligonucleotides 
was entirely absent in the extract of the Msh2-f- ES cell line, demonstrating the involvement 
of the Msh2 protein in this binding activity. As shown before (Hughes and Jiricny, 1992), 
binding was abolished by the inclusion of ATP in the binding reaction (not shown). We were 
unable to detect specific binding to G.A, G.G, or A.C mismatches under the conditions used . 
in wild-type or mutant cell extracts. In addition, no Msh2-dependent binding to a 14- 
nucleotide IDL was observed (Fig. 2). 

Msh2-f- cells have a mutator phenotype 

The Msh2-I- cell line was indistinguishable from the wild-type cell line wt-2 and the parental 
heterozygous cell line sMsh2-55 with respect to growth and plating efficiency (not shown). 
To address the role of Msh2 in MMR, we determined the stability of two microsatellite 
markers in Msh2-I- cell line 6Msh2-9 and Afj/z2+/+ line wt-2. For this purpose, 24 
subclones were derived of each cell line, which had undergone approximately 20 celf 
divisions since its generation. Chromosomal DNA, isolated from each expanded subclone 
was subjected to the polymerase chain reaction (PCR) using two end-labelled primer pairs 
(D7MUI7 and D14Mitl5\ Dietrich et al., 1994). Whereas none of the wt-2 sublines showed 
any alteration in microsatellite length, clear length alterations in many 6Msh2-9 sublines were 
seen for both microsatellites (8 out of 24 for marker D7Mitl 7 and 6 out of 24 for marker 
D14Mitl5, Fig.3). This observation strongly suggests that the Msh2 protein is involved in 
the repair of slipped replication intermediates. The microsatellite-slippage rate was estimated 
to be 10" 2 to 10" 3 per generation. 

We subsequently investigated the effect of loss of the Msh2 gene on the mutation frequency 
of a functional gene. To this purpose, 6xl0 6 cells of cell lines wt-2 and dMsh2-9 were plated 
in the presence of 6-thioguanine (6-TG) to select for cells that have lost activity of the X- 
linked Hprt gene by mutation. Whereas no 6-TG-resistant colonies were seen in the wt-2 
culture, 188 resistant colonies were present in the dMsh2-9 culture. This result extends the 
mutator phenotype of Msh2-I- cells to functional genes. 

Msh2-I- cells are tolerant to N-methyl-N'-nitro-N-nitrosoguanidine (MNNG) 
MMR was suggested to mediate hypersensitivity to agents that methylate G residues at the O 6 
position. It is believed that MMR recognises the 0 6 -meG.T mispair that occurs after 
erroneous incorporation of a thymidine nucleotide opposite to 0 6 -meG during replication 
(Griffin et al., 1994). This is supposedly followed by excision and repair synthesis, again 
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•incorporating thymidine opposite to 06-meG, triggering a new round of MMR. The net result 
of this cycling between excision and resynthesis will be the presence of single-stranded 
regions at the site of 06- me G residues which will lead to double-stranded gaps when 
replicated during S phase, resulting in cell death (for reviews, see Karran and Bignami, 
1992; Karran and Bignami, 1994). It is predicted that loss of MMR will prevent this process,' 
thus conferring tolerance to simple methylating agents. To test this hypothesis directly' 
survival of Mshl-I- cell line dMsh2-9, Msh+I- cell line sMsh2-2l and wild-type cell line wt-2 
was determined after exposure to a range of concentrations of N-methyl-N'-nitro-N- 
nitrosoguamdine (MNNG). In the growth medium, O'-benzylguanine was included to 
competitively inhibit endogenous methyltransferase activity that might otherwise remove the 
methyl groups added by MNNG (Dolan et al., 1990). Fig. 4 shows that the LD 50 for 
MNNG was increased 20-fold in the Msh2-I- cell line compared to the heterozygous and 
w,ld-type cell lines, directly proving the involvement of Msh2 in determining cellular 
sensitivity to MNNG. Heterozygous cell line sMsh2-2\ displayed no increased tolerance to 



MNNG. 



Loss of suppression of recombination between diverged sequences in 
Msh2-I- ceils 

In E. coli (Shen and Huang, 1986), Drosophila (Nassif and Engels, 1993) and mammalian 
cells (Waldman and Liskay, 1988, Te Riele et al, 1992), the efficiency of recombination 
between homologous DNA stretches is highly dependent on their sequence identity. We have 
previously demonstrated that in a gene targeting assay, homologous recombination at the 
Retinoblastoma (Rb) locus in ES cells, derived from mouse strain 129, was 50-fold more 
efficient with a 129-derived targeting construct than with a construct derived from a non- 
isogenic (BALB/c) mouse strain (Te Riele et al, 1992; Table 1). This construct contained 
0.6% base sequence divergence with respect to the isogenic 129 construct. To provide 
additional evidence that suppression of recombination was solely dependent on the 
polymorphisms between the endogenous locus and the targeting DNA, we performed the 
inverse experiment, i.e. targeting of a BALB/c-derived ES cell line with the 129- and 
BALB/c-derived constructs. This experiment yielded the inverse result, i.e. a higher targeting 
efficiency with the BALB/c targeting construct than with the non-isogenic 129 targeting 
construct (Table 1). To investigate whether MMR is responsible for this antirecombination 
effect, we have repeated the Rb targeting experiment in the (129-derived) ES cell line 6Msh2- 
9. We found that in the Msh2-I- cell line homologous recombination at the Rb locus with the 
non-isogenic construct was as efficient as with the isogenic construct (Table 1). This 
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experiment demonstrates that Msh2 is involved in preventing homologous recombination 
between diverged DNA sequences. 

Msh2-deficient mice are viable but predisposed to tumorigenesis 

To study the effect of MMR deficiency on the development of cancer, Msh2+I- ES cell lines 
sMsh2-55 and sMsh2-42 and Msh2-I- ES cell line 6Msh2-9 were used to generate chimeric 
mice. Both the Msh2 single and double knock-out ES cells gave rise to healthy animals with 
coat-color chimerism of 20 to 70 %. This result suggests that MMR deficiency did not alter 
the capacity of the ES cells to compete with wild-type cells in chimeric development. 
Msh2+I- chimeras were found to transmit the mutant allele through the germ line resulting in 
Msh2 heterozygous Fl mice. These mice, which may serve as a model for HNPCC, were 
healthy up to at least 8 months of age (Table 2). 

Intercrossing of Msh2+/- mice resulted in Msh2-I- mice according to a normal Mendelian 
distribution; they were healthy at birth and fertile. However, 30% of the Msh2-/~ mice 
developed metastasizing lymphomas of T-cell origin with a peak at 2 months of age (Table 
2). In addition, one mouse suffered from generalized histiocytic sarcoma at 3*5 months of' 
age. Extensive histological analyses of these mice did not reveal any other abnormality. Also 
healthy Msh2-/- mice sacrificed at 2 months of age were histologically normal and had 
normal B- and T-cell populations as evidenced by FACS analysis. 
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Discussion 

Lack of mismatch binding in Msh2-I- ceil extracts 

Whereas wild-type cell extracts display efficient binding to both a G.T mismatch as well as to 
an extrahelicai TG dinucleotide, extracts of the ES cell line 6Msh2-9 lack this binding (Fig. 
2), suggesting that the Msh2 protein is essential for binding to these heteroduplexes. We 
were unable to detect specific binding activity to other mismatches under the conditions used. : 
This result is compatible with binding studies by others which demonstrate that binding to a 
G T mismatch is stronger than to other mismatches (Stephenson and Karran, 1989; Hughes 
and Jiricny, 1992) whereas the independent A.C binding activity is apparently cell-type 
specific (Jiricny et al., 1988; Stephenson and Karran, 1989). The methylation-tolerant cell 
lines 'clone B' and RajiF12 were also found to lack binding to both a G.T mismatch (Branch 
et al., 1993; Griffin et al., 1994) and to an extrahelicai TG dinucleotide (Aquilina et al., 
1994). Binding to an A.C mismatch was unaffected (Branch et al., 1993). Therefore, these 
cell lines may have lost the Msh2 gene. Purified human MSH2, overproduced in E. coli, and 
purified yeast MSH2 were shown to bind to G.T mismatches (Fishel et al., 1994a; Alani et' 
al., 1995), but also strongly to IDLs of up to 14 nucleotides (Fishel et al., 1994b; Alani et 
al., 1995). Surprizingiy, we could not detect Msh2-dependent binding to a 14 nucleotide IDL 
in ES cell extracts. We are unable to explain this apparent discrepancy. In contrast to the 
binding that we observed in ES cell extracts and that of the GTBP protein, the MSH2- 
containing G.T mismatch-binding protein fraction purified from HeLa cells (Hughes and 
Jiricny, 1992; Palombo et al., 1994), the binding capacity of the purified MSH2 proteins was 
not abolished by ATP (Fishel et al., 1994b; Alani et al., 1995). This suggests that the 
purified MSH2 proteins behave differently from the naturally produced protein. The finding 
that in Msh2-/- cells length shifts in microsatellites were never larger than 4 nucleotides 
(representing two CA dinucleotides; Fig. 3), suggests that binding activity of Msh2 to 
extrahelicai loops longer than 4 nucleotides is not required to maintain microsatellite stability. 
We are currently initiating in vitro repair assays using wild-type and Msh2-I- cell extracts to 
further elucidate this point. 

Mutator phenotype of A/s/i 2 -deficient cells 

Cell line W sh2-9 had acquired microsatellite instability (the RER+ phenotype), indicating 
failure to repair slipped replication intermediates. The frequency of microsatellite instability 
was similar to that of some RER+ tumor cell lines (Parsons et al., 1993; Bhattacharyya et al., 
1994). The mutator phenotype of these cells also affects functional genes as evidenced by the 
rate of Hprt deficiency in ^dMsh2-9 culture which was increased from undetectable (i.e. 
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lower than L6xl 0" 7 ) for wild-type cells to 3xlO- 5 . This frequency is in agreement with that 
found in some RER+ tumor cell lines (Bhattacharyya et ah, 1994; Shibata et ah, 1994; 
Eshleman et ah, 1995), although cell lines that have been selected for MMR deficiency using 
a methylating agent tend to have somewhat lower mutation frequencies (Goldmacher et ah, 
1986; Branch et ah, 1993; Kat et ah, 1993; Aquilina et ah, 1994). Growth rate and plating 
efficiency of Mj/z2-deficient, heterozygous and wild-type cells were indistinguishable, 
suggesting that the mutator phenotype does not interfere with viability of the cells. This is 
remarkable, since both were seriously affected in some cell lines that were selected for MMR 
deficiency using methylating agents (Goldmacher et ah, 1986). This suggests that in the latter 
cells, additional events may have taken place that interfere with cell viability. Based on the 
mutator phenotype of dMsh2-9 cells and on the binding studies described above, we 
conclude that Msh2 is involved in, and essential for, postreplicational MMR. 

Sensitivity to simple methylating agents is mediated by MMR 

Sensitivity of cells to agents that methylate DNA at the O 6 position of G residues has been, 
attributed to futile attempts of MMR to correct the resulting 0 6 -meG.T basepairs (see 
results). This hypothesis is supported by two observations: (i) Prolonged selection of cells 
for tolerance to Methylnitrosourea or MNNG frequently results in loss of MMR (Kat et ah, 

1993) and loss of G.T-binding activity (Branch et ah, 1993; Griffin et ah, 1994). (ii) An 
RER 4 * tumor cell line was found to be tolerant to these agents and sensitivity was restored by 
introduction of the chromosome containing a wild-type allele of the defective mismatch-repair 
gene (Koi et ah, 1994). Our finding that Ms/z2-deficient cell line dMsh2-9 had acquired 
tolerance to MNNG provides direct evidence in support of this hypothesis. This finding may 
have important clinical implications. The triazene class of carcinostatic drugs probably 
derives its cytotoxic effect from methylation at the O 6 position of G residues (Michejda et ah, 

1994) . Whereas MMR-deficient tumors are supposedly tolerant to these drugs, they may 
elicit severe cytotoxity in normal tissues of the patient. In addition, the persistence of 06- 
meG.T mispairs in MMR-deficient tumor cells might lead to increased G to A transitions 
(Aquilina et al., 1993), resulting in acceleration of tumor progression. Finally, the finding 
that simple methylating agents select, or induce, MMR deficiency in cells has "led to the 
hypothesis that this process is involved in determining the tisssue specificity of tumorigenesis 
in HNPCC patients, due to the presence of such agents in the intestine (Karran and Bignami, 
1994; Varletetah, 1994). ; 
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JVIMR edits homologous recombination in mammalian cells 

Previously, it has been shown that the presence of less than 1% nucleotide difference can 
strongly suppress homologous recombination in mammalian cells (Waldman and Liskay, 
1988, Te Riele et al., 1992). Here, we have extended these results and we have demonstrated 
that Msh2-I- cells have lost this heterology-dependent suppression of recombination. Based 
on results obtained in E. coli (Worth et al., 1994), this function is probably mediated by the 
Msh2 protein, by recognizing mismatches that occur during heteroduplex formation between 
two not perfectly complementary DNA strands. This may directly destabilize the 
heteroduplex or, alternatively, prevent elongation of strand exchange. Similar results have 
recently been obtained with MS#2-deficient yeast strains (Alani et al., 1994; Selva et al., 
1995), suggesting that this role of MMR is strongly preserved during evolution. In E. coli, 
loss of MMR has been shown to result in a high frequency of inter- and intrachromosomal 
rearrangements due to ectopic recombination between diverged sequences (Rayssiguier et al., 
1989; Petit et al., 1991). More than the E. coli genome, mammalian genomes contain many 
repeated but diverged sequences. Examples of these are gene families, SINE sequences, 
LINE sequences and pseudogenes (Brosius and Gould, 1992; Nowak, 1994). Ectopic' 
recombination between these sequences would lead to genomic rearrangements that may 
either be detrimental for cell viability due to loss of essential genomic information or, 
alternatively, lead to activation of oncogenes or inactivation of tumor suppressor genes 
(Bouffler et al., 1993). Homologous recombination between identical DNA stretches should 
be allowed to proceed, however, to allow recombinational repair of double strand breaks 
using the (identical) sister chromatid as sequence donor. For this reason, mechanisms 
counteracting homologous recombination when both recombining stretches contain 
divergencies, while permitting recombination to proceed when both stretches are identical, 
may be of great importance for the mammalian cell to enable repair without risking genome 
rearrangment. Based on the effect of M^2-deficiency on gene targeting, it is likely that 
MMR plays this scrutinizing role. 

Our finding that Msh2 acts not only anti-mutagenic but also anti-recombinogenic, suggests 
that MMR deficiency may contribute to tumorigenesis by elevating the rate of both point 
mutations and chromosomal rearrangements. 

Role of Msh2 deficiency in tumorigenesis 

A^/tf-deficiency did not interfere with mouse embryonic development and is compatible with 
adult life. This observation substantiates the recent finding that some HNPCC patients seem 
largely MMR deficient in normal cells (Parsons et al., 1995). A significant fraction of 
chimeric and pure Msh2-I : mice developed metastasizing lymphomas. It is likely that the 
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-developing immune system in newborn mice, in which a high turnover of maturing T cells 
exists, offers a window where MMR deficiency can strongly accelerate the accumulation of 
transforming events. 

The basal level of mutagenesis in the cell, estimated to be approximately 10- 10 per nucleotide 
per generation, is considered to be insufficient to sustain the multistep clonal outgrowth of 
cells in tumorigenesis (Loeb, 1994). Therefore an increased mutation rate, such as caused by 
MMR deficiency, is likely to be obligatory to accelerate this process (Loeb, 1994). Our. 
results suggest that a rapid cell expansion is required to unveil the effect of MMR deficiency 
on tumor development. Corroborating this notion, recent studies of microsatellite instability 
during cancer development show that loss of MMR probably does not occur as the initial 
step, but rather within an early tumor stage such as an adenoma (Young et al., 1993; 
Aaltonen et al., 1994; Chong et al., 1994; Shibata et al., 1994; Wada et al., 1994), which, in 
HNPCC patients, have been found to evolve rapidly into carcinomas (Lynch et al., 1993; 
Jass et al., 1994). Moreover, two recent studies support the notion that enhanced 
mutagenesis due to MMR deficiency is responsible for tumor progression: (i): Some RER+ 
tumors had acquired frameshift mutations in the pYMicroglobulin gene within a CT repeat,' 
indicative of unrepaired replication slippage, and putatively leading to escape from immune 
surveillance (Bicknell et al., 1994); (ii): in some RER+ colon carcinomas from HNPCC 
patients, p53 and APC were found to contain either frameshifts in repeated nucleotides or G 
to A transitions, indicative of the failure to repair slipped simple sequence repeats and G.T 
mismatches, respectively (Lazar et al., 1994). 

In conclusion, our studies on Msh2 deficiency prove the involvement of Msh2 in MMR and 
show that MMR functions as a mechanism safeguarding the mammalian genome against 
mutation by misincorporation and recombination between diverged sequences. In addition, 
our results lend support to the hypothesis that MMR deficiency can strongly accelerate 
malignant transformation of rapidly expanding cell populations. 
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Experimental procedures 
Disruption of Msh2 in mouse ES cells 

Genomic Msh2 fragments were obtained by screening a 1290LA-derived genomic DNA 
library with a murine Msh2 cDNA probe (Varlet et al M 1994). The targeting construct was 
prepared by subcloning a 12.5 kbp BamU I fragment and inserting a hygromycin resistance 
gene (from PGK/ryg, te Riele et al., 1990) into the unique SnaB I site located within an exon 
sequence of Msh2. Cloning procedures were performed according to Sambrook et al. 
(1989). 

The targeting construct was seperated from vector sequences by gel electrophoresis, purified 
by electro elution and introduced into 1290LA-derived ES cell line E14 by electroporation as 
described (te Riele et al., 1992). Electroporated cells were seeded onto gelatin-coated 10-cm 
dishes (10 7 cells per plate) and subjected to hygromycin B selection (150 fig per ml) in BRL- 
conditioned medium (Hooper et al., 1987) the following day. After 10 days, individual, 
hygromycin B-resistant colonies were randomly picked and expanded on mouse embryonic 
fibroblasts feeder layers. DNA was extracted from expanded colonies, digested with EcoR I 
and analyzed by Southern hybridization using probes flanking both sides of the targeting 
construct (Fig. 1A). Two cell lines were obtained out of 135 hygromycin B-resistant 
colonies showing bands diagnostic for correct integration of the hyg marker between codons 
588 and 589 of one copy of the Msh2 gene. These cell lines were designated sMsh2-42 and 
sA/5/*2-55. One cell line obtained from this experiment, designated wt-2, carrying a randomly 
integrated hyg gene, was used as an Msh2+f+ control. 

To obtain an ES cell line carrying a disruption in both copies of Msh2 y 10 6 cells of cell line 
sM sh2-55 were plated onto 10-cm plates and cultured in BRL-conditioned medium 
containing 1.0 or 1.5 mg per ml of hygromycin B. After 12 days of of culturing in selective 
medium and 7 days in non-selective medium, 24 colonies were obtained which had survived 
1.5 mg per ml of hygromycin B. DNA was extracted from expanded colonies, digested with 
EcoR I and analyzed by Southern hybridization. One cell line, designated dMsh2-9 y carried 
two disrupted Msh2 alleles, had lost the wild-type copy and contained the normal number of 
chromosomes. ES cell lines which had survived 1.0 mg per ml of hygromycin B still 
contained one disrupted and one wild-type Msh2 copy. One of these Msh2+I- lines was 
designated sMsh2-2\ and used for several experiments. 

Generation of Msh2 mutant mice 

Chimeric mice were obtained by injecting 10-15 cells of ES cell lines sMsh2-55, sMsh2A2 
and dMsh2~9 into C57B1/6 blastocysts. Male chimeras obtained with Msh2^' ES cells were 
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crossed with wild-type 1290LA and FVB mice and found to transmit the mutated Msh2 allele 
through the germ line. Homozygous Msh2 mutant mice were obtained by intercrossing Fl 
heterozygotes. 

Gel shift assay 

Preparation of cell extracts, annealing of oligonucleotides, binding of cell extracts to duplex 
oligonucleotides containing mismatched or extrahelical nucleotides, and nondenaturing poly- 
acrylamide gelelectrophoresis were performed essentially as described (Stephenson and 
Karran, 1989). However, gelelectrophoresis was performed in TAE buffer rather than in 
TBE buffer. To obtain duplex oligonucleotides, the oligonucleotide U: 5-GGGAAGCT- 
GCCAGGCCCCAGTGTCAGCCTCCTATGCTC-3' (sequences were derived from 
Aquilina et al., 1994) was radiolabeled and annealed with any of the following unlabelled 
oligonucleotides: L-G.T: 5'-GAGCATAGGAGGCTGACATTGGGGCCTGGCAGCTT- 
CCC-3' (resulting in a G.T mismatch); L-G.A: 5 -GAGCATAGGAGGCTGACAATGG- 
GGCCTGGCAGCTTCCC-3' (resulting in a G.A mismatch); L-G.G: 5-GAGCATAGG 
AGGCTGACAGTGGGGCCTGGCAGCTTCCC-3' (resulting in a G.G mismatch);' 
L-A.C: 5'-GAGCATAGGAGGCTGACACCGGGGCCTGGCAGCTTCCC-3' (resulting 
in an A.C mismatch); L-TG: 5'-GAGCATAGGAGGCTGACACTGTGGGGCCTGGCA- 
GCTTCCC-3 , (resulting in an extrahelical TG dinucieotide); L-HOM: 5-GAGCATAGGA- 
GGCTGACACTGGGGCCTGGCAGCTTCCC-3' (resulting in a homoduplex); 
L-LOOP14: 5'-GAGCATAGGAGGCTGACACATACGTGAGTACTCTGGGGCCTGG- 
CAGCTTCCC-3' (resulting in an IDL loop of 14 extrahelical nucleotides). In all assays, a 
twofold excess of unlabelled homoduplex competitor oligonucleotide was included. As a 
positive control, a duplex oligonucleotide containing the binding site for the E2F family of 
transcription factors was used (Beijersbergen et al., 1995). 

PCR amplification of microsatellites 

Subclones of ES cell lines dMsh2-9 and wt-2 were generated by seeding cells onto mouse 
embryonic fibroblasts feeder layers at a density of 10 3 cells per 10 cm 2 . At that time, the ES 
cell lines were in culture for approximately 20 divisions since their generation. Twenty-four 
colonies from each cell line were expanded. Chromosomal DNA was isolated and subjected 
to the polymerase chain reaction using two end-labelled primer pairs (DJ4MU15 and 
D7MU17, Dietrich et al., 1994-). Amplified products were electrophoresed on a denaturing 
poly aery lamide gel. 
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.Mutation frequency 

6xl0 6 cells of ES cell lines Amsh2-9 and wt-2 were plated onto 486 cm 2 gelatin-coated 
tissue culture surface in BRL-conditioned medium. After two days, 6-Thioguanine was 
added at a concentration of 10 ng per ml. After two weeks the number of resistant colonies 
was counted. 

Sensitivity to MNNG 

ES cell lines dMsh2-9, sMsh2-2l and wt-2 were seeded onto MEF feeder layers at a density 
of 10 3 cells per 4 cm 2 - The following day, cells were exposed for one hour to MNNG 
ranging in concentration from zero to 36.45 u.M in serum-free medium. After 4 days of 
incubation, cells were trypsinized, stained with Trypan blue and counted. During the whole 
procedure from one hour before exposure to MNNG, 0 6 -benzylguanine (20 u,M, kindly 
provided by A. Pegg) was included in the medium in order to competively inhibit 
endogenous methyltransferase activity that might otherwise remove the methyl groups added 
by MNNG (Dolan et al, 1990). 

Rb targeting with isogenic and non-isogenic DNA 

Targeting and subsequent analysis of the Rb locus in a BALB/c-derived ES cell line (kindly 
provided by S. Rastan) and 129/OLA-derived ES cell lines dMsh2-9 were performed 
essentially as described (te Riele et al, 1992). The targeting constructs \29Rb-neo (derived 
from the 1290LA genome) and B/cRb-neo (derived from the BALB/c genome) carry an 
insertion of the pMClneo marker in exon 19 of the Rb gene and differ approximately 0.6% at 
the nucleotide level (te Riele et al, 1992). 
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Legends 

Fig. 1. Targeted disruption of the mouse Msh2 gene. 

A: Top: schematic diagram of the mouse Msh2 protein. ATP binding and HTH: putative 
ATPase and helix turn helix domains, respectively (Varlet et al, 1994). The position at which 
the protein is disrupted by insertion of the Hyg marker is indicated. Bottom: the genomic 
Msh2 locus; indicated below is the targeting construct carrying the Hyg marker inserted into a, 
unique SnaB I site within an exon sequence of Msh2. The positions of the external probes 
used for detection of targeting events are indicated. 

B: The Mshl locus in Msh2+/+ cell line wt-2, Msh2+/- cell line sMsh2-55 and Msh2-A cell 
line dM?/i2-9. Genomic DNA of these cell lines was digested with EcoRl and analyzed by 
Southern hybridization using probe 1 (Fig. 1A). Arrowheads indicate the positions of the 
wild type (wt, 20 kbp), and disrupted (ko, 7.5 kbp) alleles. 

Fig. 2. Mismatch-binding activity in wild-type and Ms/i2-deficient cells. 
Binding activity in extracts of Msh2+l+ cell line wt-2 and Msh2-/- cell line dmsh2-9 to 
double-stranded oligonucleotides containing a mismatch (G.T, G-A, G.G, or AX), an 
extrahelical dinucleotide (Extrahelical TG), or an extrahelical 14-nucleotide loop (LOOP 
14) and a Homoduplex was assessed using a gel retardation assay. Sequences of the 
oligonucleotides are given under Experimental Procedures. Arrowheads indicate the positions 
of mismatch-specific complexes. E2F: Binding to an oligonucleotide carrying an E2F site 
served as a positive control (Beijersbergen et al, 1995). 

Fig. 3. Microsatellite stability in wild-type and Msh2 -deficient cells. 

Genomic DNA of 24 subclones derived from My/i2+/+ cell line wt-2 and Msh2-I- cell line 
dmsh2-9 was amplified by PCR using primer sets D7MU17 and D14MU15. Arrowheads 
indicate microsatellite alleles with a detectable length alteration. 

Fig. 4. Tolerance of Ms/i2-deficient cells to the simple methylating agent MNNG. 
ES cell lines wt-2 (M$/z2+/+), $Msh2-55 (Msh2+/-) and dMsh2-9 (Msh2-/-) were exposed to 
increasing amounts of MNNG for one hour in the presence of 0 6 -benzylguanine. After four 
days of incubation, cells were trypsinized and counted. 
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Table 1 . Homologous recombination has lost dependence on sequence identity in Msh2 
deficient ES cells. 



ES cells 



Homologous recombination 
vs total no. G418 R col. with 



Isogenic vs 
non-isogenic 



129f?£>-neo 



B/cRb-neo 



BALB/c (Msh2+J+) 



1/68 (1.5%) 



16/72 (22%) 



15x 



1290LA (Msh2+/+) 



33/94 (35%) 



1/144 (0.7%) 



50 x 



1290LA (Msh2-I-) 



42/185 (23%) 



47/184 (26%) 



0.9 x 



\29Rb-neo and B/cRb-neo are Rb targeting constructs, prepared from the 129 and BALB/c 
strains of mice, respectively, with 0.6% sequence divergency. Targeting frequencies in 
1290LA (Msh2+/+) ES cells are derived from te Riele et al., 1992. 
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Genotype Age Condition Tumor incidence 

(months) at birth 

type frequency** age 

(months) 



Msh2+f- 8 healthy 0/110 

Msh2+/+::Msh2-f-* 6 healthy Lymphoma 1/24 2 

Msh2-I- 4-5 healthy Lymphoma 6/19 2-4 

Histiocytic 1/19 3.5 
sarcoma 



♦Chimeras consisting of wild-type and Msh2-I- cells 

♦♦Number of mice carrying a tumor per total number in the experiment 
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GENETIC MODIFICATION OF EUKARYOTIC CELLS USING DNA CONSTRUCTS 
WITH SUBSTANTIAL BASE SEQUENCE DIFFERENCES WITH RESPECT TO THE 
TARGET LOCUS. 



CLAIMS 

1 . A method for modifying the genome of a eukaryotic cell by homologous recombination 
between a specific locus in the genome and a DNA sequence introduced into a cell in which 
the mismatch repair system has been genetically and/or transitorily inactivated and said 
sequence having up to 30% base sequence differences with respect to the target locus in 
the region where recombination can take place or said sequence differs from the target locus 
to such a level that homologous recombination is normally strongly impeded. 

2. A method according to claim 1 wherein the cell's mismatch repair system is inactivated by 
disruption of both copies of one of the genes essential for DNA mismatch repair. 

3. A method according to claim 1 wherein the cell's mismatch repair system is inactivated by 
introduction of antisense RNA constructs, driven by an appropriate promoter, thus 
functionally inactivating one of the genes essential for DNA mismatch repair. 

4. A method according to claim 1 wherein the cell's mismatch repair system is inactivated by 
introduction of modified antisense oligodeoxynucleotides, thus functionally inactivating one 
of the genes essential for DNA mismatch repair. 

5. A method according to claim 1 wherein the cell's mismatch repair system is inactivated by 
expression of a dominant negativily acting version of a gene involved in DNA mismatch 
repair. 

6. A method according to daim 1 wherein the cell's mismatch repair system is inactivated 
transitorily by saturation of the mismatch repair system by the introduction into the cell of 
DNA molecules carrying one or more mis- or unpaired bases. 

7. A method according to daihris 1 -6 wherein the target cell is an embryonic stem cell or a 
related cell type derived from any mammalian species, to be used to generate chimeric 
animals and chimeric animals that can transmit the mutated genome through the germ line. 

8. A method according to claims 1-6 wherein the target cell is a cell line derived from any 
mammalian species and cultured in vitro. 
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9. A method according to claims 1-6 wherein the target cell is derived from any organ of any 
mammalian species. 

10. A method according to claim 9 wherein the modified target cell can be reintroduced into 
the organism. 

1 1 . A method according to claims 1-6 wherein the target ceil is a plant cell. . 

12. A method according to claims 1-6 wherein the target cell is derived from outbred 
organisms implicating that they contain nonidentical copies of homologous chromosomes. 

13. A method according to claim 1 wherein the targeting DNA at the region where 
homologous recombination can take place is derived from the same species as the target 
cell. 

14. A method according to claim 13 wherein the targeting DNA is modified in vitro at specific 
sites thus deliberately introducing sequence divergence. 

15. A method according to claim 1 wherein the targeting DNA at the region where 
homologous recombination can take place is derived from the same species as the target cell 
but is nonisogenic with the DNA of the target cell, the two sequences differing up to 5% at 
the nucleotide level. 

16. A method according to claim 1 wherein the targeting DNA at the region where 
homologous recombination can take place is derived from another species as the target cell, 
the targeting sequence and the target locus comprising up to 30% sequence divergence in 
the region where homologous recombination can take place. 

17. A method according to claim 16 wherein the targeting DNA is a chromosomal DNA 
fragment carried on a YAC or cosmid vector. 

18. A method according to claim 1 wherein the targeting DNA is an double- or single- 
stranded oligonucleotide consisting of 10-100 bases or basepairs of which one or several 
bases or basepairs differ from the target locus. 

19. A method according Xq claim 1 wherein the targeting DNA is constructed in such a way to 
allow the generation of deletions of chromosomal regions as follows: 

- The targeting construct consists of any selectable marker gene flanked by two sequences 
which can recombine with chromosomal loci: one flanking sequ nee being identical or highly 
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. homologous (>95% sequence identity) to a sequence of the genome of the target cell, the 
other flanking sequence being a so called .repetitive sequence (e.g. Alu, LINE, SINE) of 
which numerous diverged copies are spread over the genome. 

- The targeting construct consists of any selectable marker gene flanked by two sequences, 
one being a sequence that can act as a telomere, the other flanking sequence being a so 
called repetitive sequence (e.g. Alu, LINE, SINE) of which numerous diverged copies are 
spread over the genome. 

20. A method to test inactivation of the cellular mismatch repair system related to claims 
2,3,4,5,6 using a gene targeting assay involving comparison of the targeting efficiency of 
isogenic versus nonisogenic DNA targeting constructs. 

21 . A method to test inactivation of the cellular mismatch repair system related to claims 
2,3,4,5,6 using an assay involving intrachromosomal recombination between homologous 
but diverged DNA sequences. 

22. A method to test homologous recombination related to claims 13,14,15,16,17,18,19 using 
an Embryonic Stem cell line or any other cell line in which the mismatch repair system is 
inactived by disruption of both copies of the Msh2 gene or another mismatch repair gene. 

23. A method related to claims 8,9,10 in order to derive cells or cell lines from mismatch repair 
defecient mice carrying a disruption in both copies of the Msh2 gene or another mismatch 
repair gene. 
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